COMPUTATIONAL LOWER BOUNDS FOR COLOURFUL 

SIMPLICIAL DEPTH 



ANTOINE DEZA, TAMON STEPHEN, AND FENG XIE 



Abstract. The colourful simplicial depth problem in dimension d is to find a config- 
uration of {d+1) sets of (d-l- 1) points such that the origin is contained in the convex 
hull of each set (colour) but contained in a minimal number of colourful simplices gen- 
erated by taking one point from each set. A construction attaining + 1 simplices 
is known, and is conjectured to be minimal. This has been confirmed up to d = 3, 
however the best known lower bound for c? > 4 is f-^^^-y^] . 

A promising method to improve this lower bound is to look at combinatorial oc- 
tahedral systems generated by such configurations. The difficulty to employing this 
approach is handling the many symmetric configurations that arise. We propose a ta- 
ble of invariants which exclude many of partial configurations, and use this to improve 
the lower bound in dimension 4. 



1. Introduction 

A colourful configuration is the union of (rf -|- 1) sets, or colours, Sq, Si, . . . , S,^ of 
(d+l) points in R'^. We are interested in the colourful simplices formed by taking the 
convex hull of a set containing one point of each colour. The colourful simplicial depth 
problem is to find a colourful configuration, with each Sj containing the origin in the 
interior of its convex hull, minimizing the number of colourful simplices containing 0. 
We denote this minimum by 

Computing fi{d) can be viewed as refining Barany's Colourful Caratheodory The- 
orem |Bar82] whose original version gives i.i{d) > 1, and fi{d) > d + 1 with a useful 
modification. The question of computing ii{d) was studied in |DHST06] . which showed 
/i(2) = 5, that 2d < fj,{d) < d^ + 1 for d> 3 and that fi{d) is even when d is odd. The 
lower bound has since been improved by |BM07] . who verified the conjecture for d = 3, 

|ST08j and |DSXllj which includes the current strongest bound of fi{d) > f^^^^-y^] for 
d>4. 

One motivation for colourful simplicial depth is to establish bounds on ordinary 
simplicial depth. A point p & has simplicial depth k relative to a set S if it is 
contained in k closed simplices generated by (d+l) sets of S. This was introduced by 
Liu |Liu90] as a statistical measure of how representative p is of S. See |GrolOt IMW12t 
IKarl2l IKMS12] for recent progress on this problem. We remark also that the colourful 
simplicial depth of a point is the number of solutions to a colourful linear program in 
the sense of |B097j and |DHST08j. 

Our strategy, following |CDSX11] . is to show that a particular hypergraph whose 
edges correspond to the colourful simplices containing in a configuration cannot exist. 
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1.1. Preliminaries. A colourful configuration is a collection of (d+l) sets Sq, Si, . . . , 
of {d + 1) points in M'^. Let S = U^^qSj. Without loss of generality we assume that 
the points in S U {0} are in general position. Recall that fi{d) represents the minimal 
number of colourful simplices generated by one point from each Sj and containing in 
any d-dimensional colourful configuration with G nf^l conv(Sj). 

We take the simplices to be closed and remark that the minimum should be attained. 
Barany's Colourful Caratheodory Theorem |Bar82] gives that some colourful simplex 
containing exists in a colourful configuration with G rif^^ conv(Sj), and in fact that 
every point in S is part of some such colourful simplex containing 0. This guarantees 
that yu((i) > d+l. A colourful configuration of |DHST06] in M.'^ has only d'^ + 1 colourful 
simplices containing 0, thus yu((i) < + 1. It is known that //(I) = 2 (trivial), ^(2) = 5 
IDHSTOBj . /i(3) = 10 IBMOTj and that > f^^] for d > 4 [DSXllj . 

A colourful configuration defines a l)-uniform hypergraph onS = uf+lSi by tak- 
ing edges corresponding to the vertices of containing colourful simplices. We will call 
a hypergraph that arises from a colourful configuration with G H^^/^ conv(Sj) a config- 
uration hypergraph. The Colourful Caratheodory Theorem gives that any configuration 
hypergraph must satisfy: 

Property 1.1. Every vertex of a configuration hypergraph belongs to at least one of its 
edges. 

Fix a colour i. We call a set t of d points that contains exactly one point from each Sm- 
other than Sj an z — transversal. That is to say, an z— transversal t has i fl Sj = and 
\tr\ Sj\ = 1 for i 7^ j. We call any pair of disjoint i— transversals an i- octahedron; these 
may or may not generate a cross-polytope (rf- dimensional octahedron) in the geometric 
sense that their convex hull is a cross-polytope with same coloured points never adjacent 
in the skeleton of the polytope. 

A key property of colourful configurations is that for a fixed i-octahedron ft, the 
parity of the number of colourful simplices containing formed using points from ft 
and a point of colour i does not depend on which point of colour i is chosen. This is a 
topological fact that corresponds to the fact that is either "inside" or "outside" the 
octahedron, see for instance the Octahedron Lemma of |BM07] for a proof. Figure [1] 
illustrates this in a two dimensional case where is at the centre of a circle that contains 
points of the three colours. 




Figure 1. Two-dimensional cross-polytopes 17 containing and not. 

We carry the definitions of i- transversals and i-octahedra over to the hypergraph 
setting. Then any hypergraph arising from a colourful configuration must satisfy: 
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Property 1.2. For any octahedron fl of a hypergraph arising from a colourful config- 
uration, the parity of the set of edges using points from Q and a fixed point Sj for the 
ith coordinate is the same for all choices of Si. 

Consider a hypergraph whose vertices are S = Uf^gS* ^'^d whose edges have exactly 
one element from each set. If the hypergraph satisfies Property 11.21 we call it an oc- 
tahedral system, if it additionally satisfies Property 11.11 we call it a octahedral system 
without isolated vertex. A colourful configuration with G ^'1=1 conv(Sj) and k colour- 
ful simplices containing has a configuration hypergraph that is an octahedral system 
without isolated vertex with k edges. Then any lower bound for the number of edges in 
an octahedral system without isolated vertex with {d+ 1) colours is also a lower bound 
v{d) for ^{d). It is an interesting question whether there are any octahedral systems 
without isolated vertex not arising from any colourful configurations, and if not whether 
v{d) < fi{d) for some d. This purely combinatorial approach was originally suggested 
by Barany. 

2. A TABLE OF INVARIANTS 

Octahedral systems have the advantage of being combinatorial and finite. In principle 
for any particular d and k we can check if there exists an octahedral system without 
isolated vertex on S = Uf^g'^j with up to k edges by generating all the (finitely many) 
hypergraphs with up to k edges, each containing one element from each Sj and then 
testing if they satisfy Properties 11.11 and 11.21 The difficulty, of course, lies in the sheer 
number of such hypergraphs, and in verifying Property 11.21 efficiently. 

We can reduce the search space by exploiting the many combinatorial symmetries in 
such hypergraphs and considering only configurations that satisfy certain normaliza- 
tions. However, this alone is not sufficient to improve the known lower bounds even 
for d = A. We thus turn our attention to how to use Property 11.21 effectively. Since a 

given configuration has very many octahedra, in fact {d -\- 1) (2) , we expect that most 
unstructured hypergraphs fail Property 11.21 for many octahedra. 

There are so many octahedra that it would be difficult to verify Prop erty 1 1 . 2 1 explicitly 
for non-trivial octahedral systems. However, we can often quickly detect violations of 
Property 11.21 since hypergraphs which are not octahedral systems may fail Property 11.21 
for most octahedra. Even more, we will show that certain partial systems cannot satisfy 
Property 11.21 even with the addition of a number of edges. 

2.1. The large table. We begin by fixing an arbitrary colour as colour and an 
arbitrary 0-transversal. We can label the points in each set from to and without 
loss of generality take the transversal to contain the point of each set. For convenience 
we write edges as a string of (d+l) numbers and transversals as string of d numbers with 
* corresponding to the omitted colour. Thus the 0-transversal considered is *00 ... 0. 

There are d'^ possible octahedra formed by choosing a second 0-transversal disjoint 
from *00 ... 0. For a hypergraph of colourful edges on S, we generate a d'^ x {d -\- 1) 
table whose rows are indexed by the octahedra containing *00 ... and whose columns 
are indexed by the points of colour 0. The entries of the table are the parity of the 
number of edges using points from the octahedron corresponding to the row and the 
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point of colour corresponding to the column. We call this the large table. We observe 
that Property 11.21 implies that if the hypergraph is an octahedral system, the rows of 
the large table must be constant - either all zeros or all ones. 

For any hypergraph if we build the large table, by taking the view that the entries in 
the column are correct we can produce a score for the hypergraph by simply counting 
the entries in the remaining columns that do not agree with the column entry. So 
any octahedral system will have a score of 0, while the hypergraph that consists only 
of the edge 000 ... has the maximum possible score of cfi^^. Let z{e) be the number 
of zeros in hypergraph edge e. Then we have: 

Lemma 2.1. Adding or deleting edge e to an octahedral system changes its score by at 
most d^^^\ 

Proof. Consider first the case where the element of colour of e is j 7^ 0. Then the 
only entries of the large table that change are those in column j. Further, an entry will 
change if and only if the remaining elements of e lie in the octahedron corresponding to 
the row. Then the affected rows are those that whose non-zero transversal agrees with 
the non-zero elements of e. For zero elements of e any choice of transversal element is 
allowed. This gives the requisite number of affected entries. 

In the case where the 0th element of e is 0, elements of column are changed, 
thus changing the correctness of the remaining d entries in the affected rows, which 
themselves do not change. The number of rows affected again depends on the number 
of zero elements in the remainder of e. This is one less than the total number of zero 
elements in e, again giving the requisite number of affected entries. □ 

Lemma 12.11 allows us to make the following useful observation. Call an edge e of 
a hypergraph isolated if there is no other edge that differs from e only in a single 
coordinate. Then: 

Lemma 2.2. An octahedral system with d"^ or fewer edges must not contain any isolated 
edges. 

Proof Without loss of generality we take the isolated edge to be 000 ... and consider 
the large table formed. The hypergraph consisting of the unique edge 000 ... has score 
]sjq-^ keep track of the score as additional edges are added one at a time. Since 
000 ... is isolated, all subsequent edges have at most d — 1 zeros, and thus reduce the 
score by at most d^~^. So at least = d'^ additional edges are required to reduce the 
score to zero. The claim follows. □ 

2.2. The small table. For some cases that we are interested in it is possible to compute 
the score for the large table in reasonable time, but for the purpose of quickly showing 
that a given hypergraph is not an octahedral system (and may be far from one), it is 
effective to initially focus on a subset of the rows. One such small subset are the d 
rows generated by transversals *11 ... 1, *22 ... 2, . . ., *dd . . .d. We call the restriction 
of the large table to these d rows the small table. Note that the initial numberings are 
arbitrary, and the composition of the small table depends on this numbering, which we 
may fix later as part of our search algorithm. 
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The advantage of focusing on this d x {d + 1) table is that the entries are relatively 
independent. Only edges of the form xOO ... can change more than one entry of this 
table. After accounting for such edges, each entry can be only be affected by the 2'^ — 1 
edges that are on the relevant octahedron with the given final coordinate. 

3. Enumeration Strategy and Computation 

In this section we describe how the small and large tables, combined with symmetry 
breaking strategies, can be used to search efficiently for small octahedral systems. This 
strategy was implemented [Xiel2] in Python version 2.6 on an AMD Opteron Processor 
8356 core (2.3G Hz) and is able to prove that /i(4) > 14 in about 30 days of CPU time. 
This improves by 1 the bound of |DSXllj . 

Our strategy is enumerative: we try to build an octahedral system without isolated 
vertex by adding one edge at a time. At the start we reduce our choices by using 
symmetry. We then seek to add edges that are required by the small table. Subsequently 
we seek to add edges that are required by Lemma 12.21 and only as a last resort do 
we consider arbitrary edges. As we branch, we attempt to quickly identify partial 
configurations that cannot extend to a sufficiently small octahedral system. 

3.1. Initial assumptions. We begin by fixing an initial colour and transversal to = 
*00 ... 0; the tables we consider are with respect to to- We then use the results of 
|DSX11] . to break the problem into several cases based on I, the number of edges 
containing to, b the number of the small table octahedra that have odd parity, and j 
the minimum number of transversals covering any point of colour 0. Recall that the 
small table octahedra are those formed by to with each of tj = . . A for i = 1, 2, . . . , (i. 

It is clear that for any octahedral system without isolated vertex and with d'^ or fewer 
edges we must have 1 < l,b,j < d and that the number of edges is at least j{d + 1). 
Further, [DSXll] shows that we must have j + b > d-\-l, and that the number of edges 
must be at least {b + l){d + 1) — 2bl, as well as at least dl + 1 assuming that the colour 
is chosen to minimize / and that / > This last fact allows us to assume that 

/ < d — 1 if we have an octahedral system without isolated system with less than d"^ + 1 
edges. 

3.2. Details of enumeration. In the following we describe an enumeration that ex- 
cludes ^(4) = 13. This improves the bound of |DSX11] by 1. To rule out possible 
octahedral systems without isolated vertex of size 13 (or 14), it is sufficient to con- 
sider cases where j = 1 or j = 2. which in turn means 6 = 3 or 6 = 4. In the case 
6 = 3, we have at least 15 — / simplices, so Z = 2 or / = 3, and in the case b = 4, we 
have 20 — 3/ so / = 3; we would need to consider additional cases for / and b to rule 
out 14. In summary, we need to rule out systems where the triple {l,b,j) is one of 
(3,4,2), (3,4,1), (3,3,2), (2,3,2). 

By reordering the points of colour 0, we can take the edges xOOOO to be in the system 
for < x < / — 1, and not in the system for I < x < 4. Consider the small table after 
including these edges with / = 3, illustrated in Table [H Now if 6 = 4, then we are 
requiring that the small table be comprised entirely of I's. So in this case the entries 
in the first 3 columns are correct, while the entries in the last 2 columns are incorrect. 
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12 3 4 


*1111 

*2222 
*3333 
*4444 


1110 
1110 
1110 
1110 



Table 1. The small table with / = 3 



For (/, b,j) = (3, 4, 2) we proceed to enumerate configurations as follows. Since / = 3, 
we include initial edges 00000, 10000, 20000. We then add edges to correct each of the 
8 entries of Table [1] which must be fixed to get the correct small table for 6 = 4. As 
previously remarked, adding any edge not of the form xOOOO will change only a single 
entry in the small table. For instance, the entry in the first row and fourth column can 
be changed only by an edge of the form Sabcd where a,b,c,d E {0,1}. Given that that 
30000 cannot be added to the configuration without changing /, there remain only 15 
possible edges that change the entry, and one must be in our configuration. In fact, by 
reordering the colours we can take it to be one of 31000, 31100, 31110 and 31111. 

We could continue to exploit symmetries in this way - for instance depending on 
which of the previous 4 edges is chosen, the next edge could be one of 4 to 7 edges 
fixing the next table entry. However, we did not do this so as to avoid extensive case 
analysis. Instead, we began branching on all 15 possible edges that switch a given table 
entry until the table is correct and the partial configuration has 11 edges. 

As we branch we check two simple predictors that may indicate that the configuration 
requires several more edges. First, we look for points that are not currently included 
in any edge. If some colour still has k uncovered points, then we require k additional 
edges. Second, since any vertex of colour must be covered by at least j edges, we 
look to see which points of colour are not contained in sufficiently many edges, and 
get a score k' by totaling the undercounts. At the same time, we may find that all 
vertices of colour are already covered by more than j edges (especially when j = 1), 
in which case the partial configuration no longer belongs to this subcase and can be 
excluded. Again, we require k' additional edges. If either k or k' is sufficiently large 
(in this case 3), then the current partial configuration cannot extend to an octahedral 
system without isolated vertex with less than 14 edges and is abandoned. 

Otherwise, we examine the configuration to see if it has an isolated edge. If it contains 
an isolated edge e, the by Lemma [221 if the configuration is to extend to an octahedral 
system without isolated vertex with less than 17 edges, it must include an edge adjacent 
to e. That is, it must contain e' differing from e only in a single coordinate. There are 
only 20 such edges so we can branch on them. We then repeat the process of applying 
predictors and looking for an isolated edge until we either find an octahedral system 
without isolated vertex with less than 14 edges, or all partial configurations with fewer 
edges are exhausted. 

If we do arrive at a partial configuration with no isolated edges, then as a last resort 
we may have to branch on all possible edges. However, this happens infrequently enough 
that the enumeration ends in a reasonable time. 
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The remaining cases, where (/, b, j) is (3, 4, 1), (3, 3, 2) or (2, 3, 2) are similar. Having 
exhausted all these cases, we conclude that /i(4) > 14. 

4. Conclusions and remarks 

Octahedral systems are intriguing combinatorial objects. Using the observation that 
configuration hypergraphs generate octahedral systems without isolated vertex, we pro- 
pose a computational approach to establishing lower bounds for colourful simplicial 
depth. A straightforward implementation of this improves the best known lower for 
4-dimensional configurations from /u(4) > 13 to //(4) > 14. 

We can ask several other questions about octahedral systems. We remark that the 
maximum cardinality octahedral system without isolated vertex is the set of all possible 
edges; if we have {d + 1) sets of cardinality [d + 1) it has size {d + 1)'^+-^. It is the 
hypergraph arising from the colourful configuration of points in M.'^ that places the sets 
Si, ... , Sd+i close to vertices Vi, . . . Vd+i respectively of a regular simplex containing 0. 

Question 4.1. Can all octahedral systems without isolated vertex on {d + 1) sets of 
{d + 1) points arise from colourful configurations in M*^? 

We conclude by mentioning that many aspects of colourful simplices are just begin- 
ning to be explored. For instance, the combinatorial complexity of a system of colour 
simplices is analyzed in jST12] . As far as we know the algorithmic question of com- 
puting colourful simplicial depth is untouched, even for d = 2 where several interesting 
algorithms for computing the monochrome simplicial depth have been developed. See 
for instance the survey |Alo06] . 
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